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Abstract 

We view the RSK correspondence as associating to each permutation 
IT £ S„ a Young diagram A = A(7r), i.e. a partition of n. Suppose now 
that TT is left-multiplied by t transpositions, what is the largest number of 
cells in A that can change as a result? It is natural refer to this question 
as the search for the Lipschitz constant of the RSK correspondence. 

We show upper bounds on this Lipschitz constant as a function of 
t. For t = 1, we give a construction of permutations that achieve this 
bound exactly. For larger t we construct permutations which come close 
to matching the upper bound that we prove. 

1 Introduction 

The Robinson-Schcnstcd-Knuth (RSK) correspondence [TOl [Hj [7] maps an ar- 
bitrary permutation tt e Sn bijectively to an ordered pair of Young tableaux 
of the same shape A — X{tt). How much can A(7r) change as we mildly vary 
TT? For example, if we left-multiply tt by ^ transpositions, to what extent can A 
chang(^ We begin with the case when t — 1 and show that the resulting Young 
diagram can differ from A on at most \/n/2 cells. We show that this bound is 
tight by giving explicit constructions of permutations tt for which this bound is 
attained where the diagrams differ in at least (1 — o(l))-\/n/2 cells. We then 
turn to consider the same question for larger t and show that the corresponding 
diagram changes in at most 0[\Jnt Int) cells. The best constructions we know 
nearly match this bound and yield, e.g., (1 — o(l))-\/nt/2 changes for t — o(n). 
The outline of this paper is as follows. In the remainder of this section we recall 
some definitions and properties of Young tableaux and the RSK correspondence. 
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In Section [2] we prove upper bounds on the Lipschitz constant when t = 1 and 
show a matching construction. In Section [3] we give upper bounds and extend 
our constructions for the case of general t. We conclude with some directions 
for further research in Section |4l 

1.1 Notation and Preliminaries 

We recall some definitions and background on Young Tableaux and the RSK 
algorithm here. For more detailed expositions refer to [H [S] or [T^ . 
Let n G N be a positive integer. A vector A = (Ai, A2, . . . ) of positive integers 
is a partition of n (denoted by A h rt) if 

Ai > A2 > • • • > and ^ A,; = n. 

i 

The Young diagram (or diagram) of a partition A is a left-justified array of cells 
with Ai cells in the i-th row for each i > 1. For example, the diagram of the 
partition (5, 5, 3, 2) is 



The cell in the i-th row and j-th column is referenced by its coordinate {i,j). 
Thus (1,1) is the top leftmost cell of the diagram. 

The conjugate of a partition A, denoted by A' is the partition whose diagram is 
the transpose of the diagram of A. 

A standard Young tableau (SYT or tableau) of size n with entries from [n] is a 
diagram whose cells are filled with the elements of [n] in such a way that the 
entries are strictly increasing from left to right along a row as well as from top 
to bottom down a column. The shape of a tableau T, denoted sh(T) is the 
partition corresponding to the diagram of T. For example. 
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is a tableau of size 7 of shape (4,2, 1). Note that the elements in the cells of a 
SYT are distinct integers. Let Tn denote the set of SYT of size n. 

1.2 The Robinson-Schensted-Knuth (RSK) Correspondence 

The RSK correspondence discovered by Robinson Schensted 11 and fur- 
ther extended by Knuth 7J is a bijection between the set of permutations Sn 
and pairs of tableau of size n of the same shape. This bijection is intimately 
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related to the representation theory of the symmetric group [51 13] , the theory 
of symmetric functions [12 Chapter 7] , and the theory of partitions . 
The bijection can be defined through a row-insertion algorithm first defined 
by Schensted [TT] in order to study the longest increasing subsequence of a 
permutation. Suppose that we have a tableau T. The row-insertion procedure 
below inserts a positive integer x that is distinct from all entries of T, into T 
and results in a tableau denoted hy T ^ x. 

1. Let y smallest number larger than x in the first row of T. Replace the cell 
containing y with x. If there is no such y, add a cell containing x to the 
end of the row. 

2. If y was removed from the first row, attempt to insert it into the next row 
by the same procedure as above. If there is no row to add y to, create a 
new row with a cell containing y. 

3. Repeat this procedure on successive rows until either a number is added 
to the end of a row or added in a new row at the bottom. 

The RSK correspondence from S'„ to {{P,Q) e %. x Tn ■ sh{P) = sh{Q)} 
can now be defined as follows. Let tt e S'„ and let tt^ denote the element of [n] 
in position i in tt. Let Pi be the tableau with a single cell containing tti. Let 
Pj = Pj-i ^ TTj for all 1 < j < n and set P = Pn- The tableau Q is defined 
recursively in terms of tableaux Qi of size i as follows. Let Qi be the tableau 
with one cell containing the integer 1. The equality of shapes sh{Qi) ~ sh{Pi) is 
maintained throughout the process. The cell of Qi containing i is the (unique) 
cell of Pi that does not belong to Pi^i. The remaining cells of Qi are identical 
to those of Qi-i- Finally, set Q = Q„. We refer to P as the insertion tableau 
and Q is the recording tableau. 

Let TT e Sn and let (P, Q) be the corresponding tableaux under the RSK corre- 
spondence. The shape of tt is sh{P) = sh{Q) and will be denoted by A = A(7r). 
The RSK correspondence has numerous interesting properties (see [H E] or 
[12]). Some that will be useful in particular are as follows. 

Proposition 1.1. Let A — A(7r). Then the diagram corresponding to tt^' , the 
reversal o/tt, is A', the conjugate of X. 

Proposition 1.2. Let (P, Q) be the tableaux corresponding to a permutation tt 
under the RSK correspondence. Then the tableaux corresponding to the inverse 
permutation vr^^ are {Q,P). Thus the shape remains invariant upon inversion, 
i.e., A(7r-i) = A(7r). 

1.3 Motivation and Related Work 

In view of the important role of the RSK correspondence, it is natural to in- 
vestigate various aspects of it. Thus Fomin's appendix in 12, Chapter 7] starts 
with the following two motivating questions: 

(1) Given a partition A, characterize those permutations tt for which A(7r) = A. 
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(2) Given a tableau P, characterize the permutations tt which have P as their 
insertion tableau. 



We consider an approximate version of such questions and ask to what extent A 
changes as tt changes slightly. Question (1) is answered by the following theorem 
of Greene. 

Theorem 1.3 (Greene [5j). Letir be a permutation, and suppose that the largest 
cardinality of the union of j increasing subsequences in ir is fij, then A(7r) = 
Ai, . . . , Afc, where Ai = fix and Xj = fij — for all j > 2. 

In his study of the RSK correspondence, Knuth discovered certain equivalence 
relations that are key to the solution of Question (2) above. Two permutations 
are Knuth equivalent if one can be obtained from the other by certain restricted 
sequences of adjacent transpositions. Knuth equivalent permutations are the 
equivalency classes of permutations that have the same insertion tableau. For 
more on the subject, see \n\ . 

In order to make our question concrete, we need to specify two measures of 
distance: One between permutations and the other between diagrams. A natural 
metric on permutations is left-multiplication by adjacent transpositions. An 
adjacent transposition is a permutation of the form (z, i + 1). Left-multiplying 
TT by an adjacent transposition is denoted by (z, i + 1) o vr and means that first, 
the permutation tt is applied and then the transposition. We denote the least 
number of adjacent transpositions that transform the permutation tt to r by 
d(7r,r). Recall that d{-, •) is the graph metric in the Cayley graph of Sn w.r.t. 
the generating set of adjacent transpositions (1, 2), (2, 3), ... , {n, n — 1). We will 
say that two permutations tt and r are at distance t if d(7r, t) — t. If A and ^ 
are two diagrams, define their distance to be 

n 

A = A(A,Az) :=-^|A,-M.|- 

1=1 

Let TT and r be any two permutations. We are interested in the Lipschitz 
constant of this mapping, i.e., 

w ^ A(A(7r),A(T)) 

Lin, t) max ^ ), ) " 
a(7r,r) 

where the maximum is over all t:,t Sn with d^n, r) = t. 

The choice of left-multiplication above is in fact without loss of generality. By 
Proposition |1.2| the shape of a permutation and its inverse under the RSK 
correspondence are the same. Our results thus all follow immediately for right- 
multiplication since t = {i,i + 1) o n is equivalent to vr"^ = o (z, i + 1). 
In general, although d and A are natural metrics to study for permutations 
and diagrams respectively, the same question can be asked for other metrics. 
We discuss the extension of our results to other metrics on permutations in 
Section m 
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2 Exact Bounds on the Lipschitz Constant for a 
Single Transposition. 

In this section we will show upper bounds on the Lipschitz constant when the 
number of transpositions t = 1. We also give a construction of a family of 
permutations which achieve this bound asymptotically. 

2.1 Upper Bounds 

The first step of the proof is to show that left-multiplying a permutation by 
a transposition can result in only a bounded number of cells being different in 
each row of the diagram. 

Proposition 2.1. Let tt, t e S'„ and let A, fi be the respective diagrams. Suppose 
that T — {i^i + I) o IT, and TTi < TTi+i . Then, 

j 3 3 

i—l i—1 i—1 



Proof. Suppose that the largest cardinality of the union of j increasing subse- 
quences in TT is i. Suppose there is a subsequence which includes the pair that is 
being transposed in tt. By deleting one of the elements of the pair we obtain a 
set of j increasing subsequences of t whose cardinality is at least € — 1. If there 
is no such subsequenc e, th en the same j subsequences are also increasing in t. 



By Greene's Theorem 1.3 this implies — Mi + 1- 

For the lower bound, consider the largest cardinality of the union oi j increasing 
sequences in r. No subsequence in this union can contain both of the elements 
involved in the transposition. Since the pair involved in the transposition have 
no other elements between them in both tt and r the subsequences are also 
increasing in tt. We conclude in the same way that M* — X]i=i ^i- ^ 

Figure [l] will be useful in the following discussion. It depicts the union of two 
diagrams A and /i, which is also a Young diagram. The symmetric difference 
consists of the cells marked by a dot. The remaining set of cells of the diagram 
labeled W is the intersection of A and and this is a Young diagram as well. 

Corollary 2.2. Let tt, r G Sn where d^n, t) — 1 and let A, /i be the corresponding 
diagrams. Then at most one cell in each row and each column of the union of 
A and fi can be in the symmetric difference. 

Proof. To see this for a row r, consider inequality ([I]) for j ^ r and for j = 
r — 1 and take their difference. A similar argument applied to the reversed 
permutations implies the claim for columns, (see Proposition □ 
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Figure 1: The union of A and fi with cells of the symmetric difference marked. 



Theorem 2.3. Let tt and t he permutations in Sn with respective Young dia- 
grams A and /i, and suppose that d{-K, r) = 1. Then 



Proof. As shown in Figure [T] let ii,j) and {i',j') be the coordinates of two 



distinct cells in the symmetric difference. By Corollary 2.2 i ^ i' and j ^ j', 
and (min(i, z'), min(j.j')) g W. This gives a 1:1 map from unordered pairs of 
cells in the symmetric difference into W . Therefore, 



<n~ A 



implying the required bound 



2.2 Construction 



A<w|. □ 



In this section we construct pairs of permutations in Sn which differ by a 
single transposition whose corresponding Young diagrams dif fer by at least 



(1 — o(l))-y/n/2 cells, matching the upper bound in Theorem 3.2 asymptoti- 
cally. The following lemma characterizes the shape of a permutation by the 
cardinalities of increasing and decreasing subsequences. 

Lemma 2.4. Let t: & Sn be a permutation whose elements can be decom- 
posed in the following two ways: (i) into increasing subsequences of cardinali- 
ties Ai, A2, . . . , and (ii) into decreasing subsequences of cardinalities A'j^, A2, . . . , 
where the partitions A and A' are conjugate. Then A = A(7r). 



Proof. By Greene's Theorem |1.3| it suffices to show that for each r, the largest 
cardinality of the union of r increasing sequences in tt is X]i<r assumption 
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we know it is at least this number and we need to show the opposite inequality. 
Namely, that if si, . . . , Sj is a collection of disjoint increasing sequences in tt, 

J2i=l ^ J2i<r "^i- 

By assumption, there is a decomposition di, ^2 • ■ ■ of tt into disjoint decreasing 
subsequences of cardinalities A'j^ , , . . . . But each Sj and dj can have at most 
one element in common, so that 

r r 

$^|sd= |s.nd,| <^min{|d,|,r}-^min{A;-,r}-^A, 

2—1 r>2>l, j j j i—1 

where the last equality follows because the partitions Ai, A2, . . . and A'j^, A2, . . . 
are conjugate. □ 

Theorem 2.5. For every n there are permutations tt, r G S'„ with d(TT,T) — 1 
and respective shapes A, /i such that A(A,/i) > (1 ^ o(l))^n/2. 

Proof. Our proof says, in fact, a little more than what is stated. Namely for 
n = {k + 1)^/2 with k an odd integer, we will construct two permutations tt and 
T of shapes A = (fc + 1, fc — 1, fc — 1, . . . , 2, 2) and ^ = (fc, fc, fc — 2, fc — 2, . . . , 1, 1) 
which differ by exactly one cell in each row and column, giving A — y^n/2. 
Thus it can be verified that together with Theorem |2.3| this gives a complete 
answer to our question for n of this form. For other values of n we get the result 
by padding this basic construction. In the discussion that follows we decompose 
these permutations into monotone subsequences. The decompositions we exhibit 
are not necessarily unique, but for our purpose any decomposition suffices. 
The construction can, perhaps, be best understood by observing alongside with 
the general discussion a concrete special case. So we intersperse our general 
constructions with an illustration that shows how things work for n = 18 (fc = 
5). We start by dividing the elements of [n] into three categories according to 
their magnitude. The "small" elements are those in the interval [l,n/2 — ^y^]. 
The next fc + 1 elements, i.e., interval [n/2 — ,11/2+ are "intermediate" 
and members of the interval [n/2 + ^i^,n] are "big". 

We further subdivide the big elements (in order) into blocks 61, . . . , fo(fe_i)/2- 
The small elements are split (in order) into blocks S(a;-i)/2, . . . , si. Both Si and 
bi have cardinality 2i. 



S2 si bi 62 

(1 2 3 4) (5 6) 7 8 9 10 11 12 (13 14) (15 16 17 18) 

small intermediate big 

The permutation tt is constructed by spreading out the intermediate elements 
with n/2 and n/2 + 1 remaining fixed points (see below). The blocks of big 
elements are then inserted in the order 6(fc_i)/2, . . . , 61 in the spaces between the 
smaller intermediate elements while the blocks of small elements are inserted in 
the order si, . . . , S(fc_i)/2 in the spaces between the larger intermediate elements. 
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To obtain r we apply the transposition (n/2, n/2 + 1) to tt. The permutations 
are defined in this manner with a view to decomposing them into increasing and 
decreasing sequences of desired cardinaUties. 



TT = 7(15 16 17 18) 8 (13 14) 9i0 (5 6) 11 (1 2 3 4) 12 
T = 7(15 16 17 18) 8 (13 14) 10 9 (5 6) 11 (1 2 3 4) 12 



From the construction we claim that n and r can be decomposed into a disjoint 
union of increasing subsequences of cardinalities (fc + 1, fc — 1, fc — 1, . . . , 2, 2) and 
{k,k,k — 2, k — 2, ... ,1,1) respectively. For tt the increasing sequences consist 
of (i) The intermediate elements, which in our example is 7, 8, 9, 10, 11, 12, (ii) 
The blocks of small elements, i.e., 1,2,3,4 and 5,6 and (iii) The blocks of big 
elements, i.e., 15, 16, 17, 18 and 13, 14. 

The permutation r can be decomposed into the increasing subsequences of the 
following three types: (i) An intermediate element and the block of big ele- 
ments following it, which in the example are 7, 15, 16, 17, 18 and 8, 13, 14, (ii) 
A block of small elements and the following intermediate element, i.e., 5,6,11 
and 1, 2, 3, 4, 12 and (iii) The two subsequences of length one consisting of one 
of the two middle intermediate elements, i.e. 10 and 9. 

The proof that tt and r have the shapes A (fc + 1, fc — 1, fc — 1, . . . , 2, 2) and 



11= {k,k,k — 2, k — 2, ... ,1,1) respectively uses Lemma 2.4 It is enough to 
decompose tt and r into a union of decreasing sequences whose cardinalities 
are given by the respective conjugate sequences. Note that as it happens, the 
shapes A and fi are conjugates. 

We assign the elements of tt to decreasing subsequences di, . . . , dfc+i of cardinal- 
ities fc. A:, . . . , 1, 1. as follows. Since the di are subsequences, elements in them 
appear in the same order as in the permutation. Secondly, the assignment is 
made so that each subsequence has exactly one of the intermediate elements, 
and it appears after any of the big elements and before any of the small ele- 
ments. (There is more than one way to do this.) We first see how this is done 
in the example. 

We first construct di and d2 which are both decreasing sequences of length 5. 
The largest elements in the blocks bi, the largest elements in the blocks Si and 
one of the middle intermediate elements are assigned to di. Then we choose d2 
similarly from among the remaining elements. 



di : 7(15 16 17 [is]) 8 (13 [u]) [d] 10 (5 [&]) 11 (1 2 3 0) 12 



d2 : 7(15 16 [l7]>g) 8 ([IsjX) ^ ^ ([5] ^) 11 (1 2 g 12 



The remaining elements can be seen to have the same structure recursively (the 
remaining elements appear in the same relative order as would the elements of 
the permutation for n = 8), where the brackets indicate blocks of big and small 
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elements as before. 



7(15 16) 8 11 (1 2) 12 

To assign elements to dz and ^4, we want to continue with the strategy of 
choosing the largest elements that remain in the blocks. Note that since the big 
elements 13 and 14 have been assigned, there are no big elements that follow 
the element 8, and it now becomes "available" . Thus ds and ^4 are constructed 

by assigning the largest elements that remain in the small and big blocks and 
one of the remaining intermediate elements in the middle of the blocks. 

da : 7(15 [mjX ^ ) [s] 0^ M) ^ (U) 11 (1 @^4)12 



d4 : 7(^ >€r X>g) ^ i^M) 1>>^ 1^ f) 12 



Proceeding the same way, we obtain the subsequences: d\ = 18, 14, 9, 6, 4, d2 = 
17,13,10,5,3, ds = 16,8,2, di = 15,11,1, ds = 7, de = 12. In general, the 
assignment is done as follows. 

• The i-th largest element in each block of big elements, is assigned to the 
subsequence di. 

• The j-th largest element in each block of small elements, is assigned to the 
subsequence dj. 

• For the intcamediate elements, assign the lower (k + l)/2 elements to 
the subsequences dk,dk-2, ■ • ■ - di (in that order) and the top {k + l)/2 
elements to ^2, • ■ • , dfc-i, dfe+i (in that order). 

Clearly, this is a decomposition of [n] with exactly k — 2[{i — 1)/2J elements 
in dj. It remains to show that each di is a decreasing subsequence. By the 
construction of the permutation, the big and small elements in di form a de- 
creasing subsequence since each of them is from a different block. Secondly, the 
intermediate element in dj appears after all the big elements and before any of 
the the small ones. 

Similarly, for the permutation r, we define the decreasing subsequences fi, . . . ,fk 
of cardinalities fc + 1, fc— 1, fc — 1, . . . . 2, 2, where \ fi\ = k + 1 — 2[i/2j . As before, 
the assignment is made so that each sequence but for one (which has the two 
middle intermediate elements) has at most one intermediate element, and at 
most one element from each of the small and the big blocks. In our example, 
we construct /i, a subsequence of length 6, by taking the largest element from 
each block and the two middle intermediate elements. 



/i : 7(15 16 17 [18]) 8 (13 \Uj) [lO] [9] (5 [e]) 11 (1 2 3 0) 12 



Next, we choose /2 and /a which are both subsequences of length 4. At this 
point, we cannot continue to follow the strategy of assigning the largest elements 
from each block to /2 (by choosing 17, 13, 5, 3) as in the next step we would fail 
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to construct /a of length 4. Instead, note that when only one element remains in 
a block of small elements, the intermediate element which follows that block has 
not yet been assigned and it does not follow any other small elements. Thus the 
strategy for /2 is to assign to it the largest elements from all blocks except from 
Si in which only one element remains, and to assign the intermediate element 
following si to /2. To construct /s, we take the largest remaining elements in all 
the blocks, and the intermediate element that precedes the block of big elements 
whose smallest element was assigned to f2- Diagrammatically, we have: 



/2 : 7(15 16 [l7]>g) 8 (^X)>0'^(5 ^) ^ (1 2 [s] /) 12 



/3 : 7(15^ >r i^gO^ X (10 ^ )i) 12 



Repeating the same arguments for the remaining elements, we obtain the fol- 
lowing subsequences for the example: /i — 18,14,10,9,6,4, /2 = 17,13,11,3, 
/a = 16,8,5,2, /4 — 15,12, ~ 7,1. In general, the subsequences can be 
defined as follows. 

• The i-th largest element in each block of big elements is assigned to fi . 

• The smallest element in a block of small elements Sj is assigned to f2j+i- 
Among the remaining elements, the i-th largest element goes to fi. 

• The lower {k — l)/2 of the intermediate elements go to f^, fk-2, ■ ■ ■ , fa (in 
that order). The top {k — l)/2 elements to /2, . . ■ , fk-i (in that order). 
The two middle intermediate elements are in /i. 

As before, the fi constitute a decomposition and they have the appropriate 
sizes. By construction, the big and small elements in any subsequence fi form 
a decreasing subsequence. Lastly, for i ^ 1 there is at most one intermediate 
element in fi and if one exists, it appears after all the big elements and before all 
the small ones. For i = 1, the two intermediate elements appear consecutively 
in decreasing order, after all big elements and before all small ones. Thus, tt 
and T have the claimed shapes and it follows that 

For n not of the form (fc + 1)^/2, we construct two permutations as follows. Let 
no < n be the largest integer such that no = {k + 1)^/2 for odd k. The first no 
elements of tt and r are set according to the construction above on no elements. 
The last n — no elements of both tt and r are no + 1, . . . ,n. Then, we have that 

A = ;f^(i-»a))\/|. 

We have carried out computer simulations and found other pairs of permutations 
for which the bound holds with equality. Several mysteries remain here, a few 
of which we mention in Section [H 
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3 Bounds on the Lipschitz Constant for t > 1 



In this section we show bounds on the Lipschitz constant for t > I. Extending 
the arguments from the previous section for both the upper and lower bound 
gives bounds that are tight up to constant factors for t — 0{1). In the latter half 
of this section we give a more complicated argument that yields an improved 
upper bound for general t. 

3.1 A construction for permutations at linear distance t. 

The construction for the case of one transposition can be extended to the case 
of more than one transposition as follows. 

Theorem 3.1. Let t < n/2. For every n there are permutations Tr,T € Sn with 
d(7r,r) = t and respective shapes X, fi such that A(A, /i) > (1 — y^t/2n)y^nt/2. 

Proof. Let k = l^/2nji — Ij, and m = {k + 1)^/2 so that mt < n. Divide 
the first mt elements of [n] into t blocks of length m each. To construct the 
permutations, in each block we permute the elements as in the construction for 
one transposition, and then concatenate the blocks with the remaining n — mt 
elements following. Then, it is not difficult to see that the RSK algorithm on 
this pair of permutations will result in a shape with t of the smaller Young 
diagrams corresponding to each block being pasted one after the other, with an 
additional n — mt boxes in the top row of each diagram. Then, A = t.Jm/2 > 
yjnt/2{l - ^Jtj2n). Thus when t = o(n), A > (1 - o{l))^nt/2. □ 

3.2 Upper Bounds 

We start with an easy observation: 

Theorem 3.2. Let tt and r be permutations in Sn such that d{T:,T) = t. Let A 
and II be the respective Young diagrams. Then 

A = A(A,/i) < t 

Proof. Since d(7r, r) = t, there is a sequence of permutations tt — ctq, CTi, . . . , Ci = 
T such that for each < i < t, ai and (Ti+i differ by an adjacent transposition. 
The distance A(-, •) is a metric on diagrams and hence the bound follows by the 
triangle inequality from Theorem |2.3[ □ 

We do not see how to appropriately adapt the bijective argument of Theorem 
|2.3| However, the following argument yields a near-optimal bound. 

Theorem 3.3. Let tt, r G 5„ be such that d{'K, r) = t. Let A, /i be the corre- 
sponding diagrams. Then 

A(A,^) < 0{\/nt \iit). 
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A-block 
/^-block 



Figure 2: The union of A and and the symmetric difference split into blocks. 



We start by showing some preliminary results that will be useful in the proof. 
Suppose TT and r are two permutations such that d(7r, r) = t. Let n — cTQ,ai, . . . , 
at = T he a sequence of permutations such that for each < i < i, ct^ and ct^+i 
differ by an adjacent transposition. Say that s (resp. r) of the transpositions 
put the relevant pair in decreasing (resp. increasing) order, where t = r + s. 
Let A and /i be the diagrams corresponding to tt and r respectively. 

Lemma 3.4. Let 7r,T be as above. Then, 



(2) 



Proof. For each pair ct^, ai+i in the sequence of permutations tt = ao^ai, . . . ,<7t = 
T, by Proposition 2.1 the inequality ([T]) holds for the diagrams corresponding to 
ai and (Ti+i. The result is obtained by adding up all these inequalities. □ 

In Figure[2]we depict the union of two diagrams. Their intersection is labeled W 
as before. We split the symmetric difference of the two diagrams into blocks. We 
say that j indexes a A-row if A^ > fij- A maximal interval of A-rows determines 
a X-pre-block. A maximal collection of consecutive A-pre-blocks constitutes a 
X-block. We likewise define /i-blocks. Blocks are labeled Bi as in the figure. The 
number of cells in a set S will be denoted by A{S). We use the following fact 
about the sizes of the blocks. 

Proposition 3.5. Let d{TT,T) = t with corresponding diagrams X, fi and let B 

he a block in the union of the diagrams, then A(B) < t. 
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Proof. This bound is obtained from Lemma 3.4 as follows. Let B reside in the 
set of rows / of the diagram. Assuming it exists, let zq be the row just preceding 
/, and ii = max/. Then the bound is obtained by subtracting the inequality 
([2| corresponding to j = io from the inequality corresponding to j = ii, and 
using the fact that r + s = t. If there is no row ip, then the bound is immediate 
from the inequality for j = ii- □ 

The main step in the proof of Theorem |3.3| is the following lemma about two 
sequences of integers. 

Lemma 3.6. Let k > 2, T > 3 and let ai, . . . , Ok and bi, . . . ,bk be two sequences 
of positive integers. Denote A = X]i=i '^i^i '^'^'^ ^ ~ ^i<i<j<k o-i^j- If 

ai = bk = I and \fi, Oibi < T, 

then 

A < y/-i2NT\nT. 
This bound is tight up to constants. 

We first show how to derive the theorem from Lemma 13.61 
Let A and be two diagrams of size n (not necessarily corresponding to per- 
mutations at distance t). For the union of these diagrams, define the blocks of 
the symmetric difference {Bi] and W as before. Suppose that for each block i?. 



A{B) < t. To prove Theorem 3.3 it is sufficient to show that for these diagrams 



tint 



A{W) + \Y,A{B,) 



(3) 



With this formulation in mind, we can make the following assumptions about 
the pair of diagrams. The aim is to make a number of transformations and show 
that the pair of diagrams can be assumed to be of the form shown in Figure [6j 

Reduction 1. For any row i, Xi ^ fj,i and similarly, for any column j , X'j ^ fj,'j. 
If this is not the case (as in the shaded part of Figure^, we delete such rows or 
columns from both A and ^. Consequently, A{W) decreases, whereas 
remains unchanged. Thus, if the bound holds for the new pair of diagrams, it 
holds as well for the old pair. 

Reduction 2. In general, each block is a skew-diagram (the set theoretic differ- 
ence of a diagram and another contained in it). However, as we show, we may 
assume it is a Young diagram. The dotted lines in Figure mark the "shade " 
of a block in W determined by its top row and leftmost column. If a block is not 
a (left- aligned) tableau, we can change it to one by removing the cells of W in 
its shade and replacing it with a Young diagram of area A{B) contained in the 
union of the block and its shade. 
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Figure 3: Rows and columns of W that may be removed. 



This transformation decreases A(Vl^) and keeps the size of the block fixed. Sec- 
ondly, we may assume that the transformation is done so that all rows of a 
block, with the possible exception of the last one have the same length. The 
result of such a transformation on the blocks B2 and B3 is shown in Figure [5j 

Reduction 3. We may assume that the topmost block Bi has a single row. 
Otherwise, we can shift all the cells of Bi to the first row without changing any 
A{Bi) or A(W). We can then delete any rows of W which are of the same 
length in A and /i. By similar reasoning, we may assume that the bottommost 
block has a single column. 

Thus, we may assume that the diagrams are as shown in Figure |6] and that the 
sizes of the blocks are bounded by t. As in the figure, let and hi denote the 
lengths of the vertical and horizontal sides of the rectangle which bounds the 
block Bi. Thus the area can be written as a sum of areas of rectangles 

Qibj whose sides are determined by the side lengths of pairs of blocks. Also note 
that by our construction of the blocks, Oibi < 2t. 

To obtain the formulation of the lemma, suppose that we add cells to the last row 
of each block to complete it to a rectangle. Denote the modified blocks by B' . 
Then for each block, A{B') < 2A{B). If we show the bound for these modified 
diagrams with a bound of 2t for each block, then the bound is implied for the 
original diagrams since the constants can be absorbed by the O(-). Formally, 
this follows from the following inequalities. 

2. A{W) + \ A{B';) < 2 {A{W) + \ A{B,)) 
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B 

3 



B 

4 



Figure 4: The top left corner of the block. 



Figure 5: All blocks are Young diagrams. 
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b b 

4 , } 



Figure 6: A box of side lengths and hi bounds Bi 



Thus Lemma 3.6 imphes the bound ([3| for a pair of diagrams as above and we 
have verified that to prove Theorem 3.3 it is sufficient to prove the lemma. 



Proof of Lemma 



3.6 



We will minimize iV/A^. For fc = 2, the lemma can be 
easily verified by calculation once we use the fact that — — ^- Thus we 
will assume that A: > 3. Consider the following relaxation of the minimization 
problem where the a^, hj are not necessarily integral. 



E 

N i<i<j<k 



ciibj 




S.t. fli = fefc = 1 

o-ibi < r, 1 < i < k 
a,; > 1, 2<i<k 
6, > 1, 1 < i < fc - 1 



We will use the method of Lagrange multipliers (see Appendix |A] for a brief 
introduction) to obtain a lower bound on the value of the objective above at any 
local optimum. Since the problem is a relaxation of the discrete minimization 
problem, this also lower bounds the objective of the discrete problem. We obtain 
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the following Lagrangian for the relaxation above. 

^ k k k 

min£ - ^Xi{aih - T) ~ ^/ii(aj - 1) - ^Viibi - 1) 

2 — 1 i—1 2 — 1 



The Karush-Kuhn- Tucker conditions yield the following necessary conditions 
for minimality. 

TT-'^^TT-^-^'^^-'^^^O' l<i<k (4) 
d d N 

A, > 0, A,(T - a^bi) =0, 1 < i < /fc 

> 0, fi^ia, - 1) = 0, 2<i<k 

> 0, v,{bi - 1) = 0, 1 < i < fc - 1 (6) 

From these conditions, we can show that at optimality either aibi = T or 1. 
Suppose that for some i, a^fei < T. Note that by the conditions above, this 
implies = 0. Now, if aibi ^ 1, at least one of or bi is > 1. Assume without 
loss of generality that bi > 1 (the argument in the other case is exactly the 
same). In this case t'i = by (|6]) . Hence from ([5| above, we have 

and therefore, since A > 

A^— =2AiV— 
obt db^ 



j=i_ ^ N 
2a, A 



(7) 



Now we show that it is possible to increase bi by a factor (1 + e) for e > so that 
N/A'^ decreases and we can conclude that the solution is not optimal. This is 
allowed, at least for e > small enough, since, by assumption aibi < T. Let N' 
and A' be the summations as defined before for the sequences where we replace 
h by b,{l + e). 

i 

N + bie^ aj 

N' 

(A')2 (A + aAsr 
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To prove the claim N' /{A')'^ < 7V/A^, using the right hand side above, it is 
enough to show that 

i 

A'^he'^aj < N {2AaAe + {a^bief) 
Or equivalently, dividing throughout by 2aibieA'^, that 

i 

1=1 N NuibiS 

< 1 —. 

2a, A 2A2 

This inequality follows by ([7|. The left-hand term equals the first term on the 
right and e > 0. 

The next step is to argue that it is enough to show the claimed bound assuming 
that the blocks are arranged in a specific manner (i.e., the sequences are of a 
certain form). In particular, the blocks of area T are arranged such that is 
increasing and bi is decreasing. Secondly, the blocks of area 1 occur after all 
blocks such that a,; < bi and before all blocks such that > bi. This can be 
argued by noticing that such an arrangement can be achieved by exchanging 
blocks which are out of order since A remains unchanged and N does not in- 
crease. Thus a lower bound on iV/A^ for the modified sequence is a lower bound 
on the corresponding quantity for the original sequence. 

We next argue that, in fact, w.l.o.g. no block has area 1. Recall that we wish 
to show 

32TlnT7V- A^ > 0. 
We will show that if we add a single block of area 1 then 

32r In TN' - A'2 > 32T In TN - A^ (8) 

where A^' and A' are the modified values of N and A. This inequality above 
allows us to reduce the argument to the case when there are no blocks of area 
1. Let the shorter sequence have k terms. Note that A' = A-|- 1 and the change 
in the number of cells A^' — is at least iiiiii(ai, ^i)- 

Recall that fc, T > 3 and for 1 < i < fc, a^, bi > 1. Thus, 

(A')^-A2= 2A + 1 

< 2kT + 1 

< 32fcrinr 

k 

< 32nnT min(ai, 6i) 

1=1 

= 32rinT(A^'- A^) 
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which imphes the required inequahty (|8|. 

In the next step, we will make a further simphfication to the picture. To summa- 
rize, we now know that we may optimize over sequences such that each block has 
size T, oi = 6fc = 1, the sequence is non-decreasing and bi is non- increasing. 
The claim is that the optimal solution is of the form where there is some i such 
that fli > 1 and bi > 1. If not, then it can be checked that A — \/8NT and the 
claimed bound holds. 

We relabel the sequences a_£j , . . . , o_i, ai, . . . , a^, ak+i, ■ ■ ■ , ak+i2 and b-i^ , ■ ■ ■ , 
6i, . . . , &fc, . . . , fofc+fj where ^i, ^2 > 1 and k > Q so that Oi, bi > 1 for 
1 < i < fc. Let N and A be the corresponding summations as defined before. 
We can reformulate the minimization problem as follows. 

N 

mm^ 

s.t. a, = 1 = bk+j,ak+j =T^b„ - £1 < i < -1, I < j < £2 

a,bi = T, Vi 

a,,b,>ll<i<k (9) 

Solving this optimization problem gives the following conditions for the solutions 
(see Proposition B.l in the Appendix [b] for the detailed calculations). The 
sequences &i<i<fc and (hence ai<i<fc) are a geometric series with 

^ _ ^ _ _ bk-i 
b2 bk 

and 

_ 02 _ _ Qfc 
fli ak-i 
and the ratio between successive terms c > 1. Also, 

6fc = (c-lK2. 

and 

Substituting, we also have that 

&i=c'=-i6fe = c'=-i(c-l)4 

T = ai&i = c'=-i(c- l)2^i4 (10) 

Since ai > 1 we have 

01 



max{^i,4} ' 
Furthermore, c'^~^ < T and therefore 



InT ^ InT 

h^ - ln(l + l/max{4,^2})' ^ ' 
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In the next step we will show the w.l.o.g. we may assume £i = £2 = 1- Let 
^0 = El<^<J<k azbj +J:^b^ + E^a^ + l + 2T and Ao = Eti a^b^ + 2T. These 
are the values of the summations with the first £1 — 1 and last £2 — 1 members 
of the sequences removed. Then 

Aih 



N = No + T- 



1) , y^2(^2 



+ i£i - 1) ^ + {£2 -l)Y, + (^1 - 1)(^2 - 1), 

1=1 i=l 

A = Ao+r(£i+4-2) 

We have above that ^1,^2 > 1- We will show that the optimal of 32rinTA^- A^ 
when A^o and Aq are fixed is at £1 = ^2 = 1 by showing 

32rinT7V - A^ > 32rinTA^o - Ag. 



Wlog, suppose that £1 > £2 so that £1 > 2. Therefore by ( 11 ) and using the fact 
that for X < 1/2, ln(l + x) > x/2, we have 



1 < 



InT 



ln(l + l/max{£i,^2}) 



< 21nrmax{^i,£2}- 



(12) 



Now, we have 



£^ (£^ - I] 

32TlnT(iV - Nq) > 32T\nT{T^^ '-). 



On the other hand, by the bound from ( 12 ) on fc 
A^ - A^ = 2AoT(^i +£2-2 



T^{£i+£2-2y 
= 2kT^{£i +£2-2) + T^{£i +£2- 2f 

< 4kT^{£i - 1) + 4T^{£i - 1)2 

< 12T^ lnr4(^i - 1) +4T2(£i - 1)2 

< ler^lnT^l^i - 1) 



< 32r In T(T 



£i{£i - 1) 



< 32rinT(A^- TVo) 

as required. Finally, if ^1 = ^2 = 1, then using the fact that bi < T, from (10), 
we obtain that c > 2. Hence by ^ k < 2lnT + 1. Note that if £1 = £2 = 1, 
A = (fc + 2)T. We can then use the following straightforward bound. 

N 1 _ 1 1 
A2 - A ^ (fc + 2)T - 4T\nT' 



Theorem 13^ now follows. 



□ 



20 



As we show next, the upper bound of Lemma 3.6 is tight. To construct a pair 
of diagrams where the Ui and bi are integers and A — fl{\/NT InT) we argue as 
foUows. Let T = 2*^, c = 2, 61 = 1 and bk = T. Then A is at least n{VNTWT). 
However, the tightness of Theorem |3.3| does not follow from this since it is not 
clear that there exist corresponding permutations. 



4 Conclusions 

A number of interesting directions remain for further research. 

Characterize extremal permutations. The permutations constructed in 
Section [2] achieve the maximum difference in the shapes for one transposition. 
There it was possible to construct the examples by carefully arranging the in- 
creasing and decreasing sequences. On the other hand, with the help of a com- 
puter, we observed several other examples whose structure we do not completely 
understand. We know that for A to achieve the upper bound, by Greene's Theo- 
rem, the permutations must be decomposable into unions of increasing sequences 
whose sizes are given by the required shape of the diagram. An example of such 
a pair of permutations from simulation for n = 18 is: 

13 14 10 15 6 1 18 2 16 9JI 12 3 7 17 8 4 5 
13 14 10 15 6 1 18 2 16 11 9 12 3 7 17 8 4 5 



Notice that in this example the permutations cannot be decomposed into con- 
jugate increasing and decreasing sequences as done in our construction. In our 
view the class of such permutations is an intriguing mathematical object. We 
would like to know how many such permutations exist, what their structural 
properties are etc. This seems like a good subject for further work in this area. 



Constructions for t > 1 transpositions. As mentioned, we do not know 
whether there exists a pair of permutations corresponding to the diagrams which 
are tight for Lemma |3.6| We do not see how to extend our construction for one 
transposition to this case. 

Secondly, our constructions achieve Cl(y^nt/2) differences when t < n/2. The 
behavior for larger t is still unclear. For example, the maximum possible value 
of A is n — 1, and this is uniquely achieved with t = (") transpositions. We also 



know from Theorem 3.1 that we can make A > Q,{n) with t < 0{n). We still 



do not know how large t should be to make A > an with a close to 1. 

Dependence on transpositions. It would be interesting to obtain more de- 
tailed information about the change in A as a result of left-multiplication with a 
transposition. Knuth and Knuth-dual equivalence classes characterize transpo- 
sitions which keep A fixed. What is the expected change in A for a transposition 
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in a random permutation? How do the position of the transposition or proper- 
ties of the permutation affect the change? 

Other metrics. In this work we studied the adjacent transposition metric 
on permutations but there are a number of natural measures for the distance 
between two permutations which may be worth studying in this setting. 
For example it can be verified that up to constants, the same bounds on the Lip- 
schitz constant hold for the distance d! on permutations with respect to general 



(not necessarily adjacent) transpositions. The lower bounds from Theorems 2.5 
and |3.1| hold since the constructions give permutations tt and r which differ by 
1 and t transpositions respectively. On the other hand, the upper bounds on 
the Lipschitz constant follow (and are within a constant factor of the bounds for 
adjacent transpositions) since by Greene's Theorem the bounds in Proposition 



|2.1| change only by a small additive constant and in Lemma 3.4 this translates 
to the absolute value of the difference between the sums being bounded by 2t if 
the permutations differ by the multiplication of t transpositions. 
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A The Method of Lagrange MultipHers 

The method of Lagrange multiphers is used to solve for the maxima or minima of 
a real- valued multivariate function subject to equality constraints. In particular, 
the method gives necessary conditions for optimality which are the analog of 
the conditions on the gradient for unconstrained problems. The Karush-Kuhn- 
Tucker (KKT) conditions for optimality generalize these to the case when some 
of the constraints may be inequalities. Consider the following optimization 
problem, where x = {xi,. . . , a;„) and aj,(3k € K: 

min/(x) 
s.t. gj{x.) <aj, j = !,...,£ 
hk{-x.) = I3k k = l,...,m. 

The Lagrangian for this problem is defined to be the function: 

£ m 

C := C{x,Xj,Hk) = /(x) -^Xj{gj{x) - aj) - ^ Mfe(/ife(x) - Pk)- 

i=i fc=i 

The KKT conditions say that if a local optimizer x* satisfies certain technical 
"constraint qualifications" (explained below) then there are constants (j = 
!,...,£) and /j.^ (fc = 1, . . . , m) satisfying 

£ m 

V£ = V/(x*) - ^A*V5,(x*) - ^MfcV/ife(x*) = 

A* >0, j = !,...,£ 
A*5i(x*) = 0, j = l,...,t 

A number of constraint qualifications arc known to be sufficient for the result 
and in our case, the so-called Mangasarian-Fromovitz constraint qualification 
holds. This condition requires that at x* , the gradients of any active inequality 
constraints and the gradients of the equality constraints are positively-linearly 
independent. A collection of vectors (wi, . . . ,Vd) is positively-linearly dependent 
if there are oi > 0, . . . , > 0, not all such that ^ • aiVi = 0. for the 
optimization problems we consider, the constraint qualification can be verified 
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without much difficulty for any possible set of active constraints, so we leave 
this to the reader and assume that the KKT conditions are satisfied. For more 
details regarding the method of Lagrange multipliers and extensions, the reader 
may refer to [2]- 

B Solution to the Minimization Problem 

We show below the calculations that solve the minimization problem in (|9| 
which is reproduced below. 

TV 

mm^ 

s.t. = 1 = bk+j,ak+j = T = bi, - li < i < -1, I < j < (2 

a.ibi = T, Vi 
o-i,bi > 1 1 < i < k 

Proposition B.l. At a minimum of the optimization, the sequences bi<i<k 
and (hence ai<i<k) are a geometric series with 

c - ^ - ... - ^fe-i 
b2 bk 

and 

^ _ 02 _ _ gfc 



ai Ok-i 
and the ratio between successive terms c > 1. Also, 

bk = {c-l)l2- 

and 

ai = (c-l)4. 
Proof. We obtain the following Lagrangian. 

k k 



min£ =^ - '^\i{aA - T) - ^/^^(aj - 1) - - 1) 



i=l 2=1 



From the KKT conditions for optimality, we obtain: 
d d N 

-/:=-^-A.6.-^. = 0, l<^<k (13) 

d d N 

^/:=^^-A.a,:-..=0, l<^<k (14) 

Mi > 0, ^ii{ai - 1) = 0, 1 < z < A; 

> 0, - 1) = 0, l<i<k 
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As outlined before, we may assume that the optimal solution is such that Ui.bi > 
1 for 1 < z < fc. Hence by the conditions above, fii — Vi — Q. Performing the 



differentiations in (13) (w.r.t. a,) and (14) (w.r.t. hi) and multiplying them by 
ai and bi respectively we obtain the following relations. 



ajbiN 



— XiUibi — 



at{bi^ h bk + bk+i H h fofe+fj 



= 0, l<i<k (15) 



aib,N 



A4 



— XiUibi — 



bi{a^i^ H h a_i + , 



A4 



= 0, l<i<k (16) 



Equating ( 15 ) and ( 16 ) and cancelling terms, we conclude that since A 7^ 0, 



ai{bi + h bk + £2) = h{li + ai H h a,), l<i<k 



(17) 



We can solve the above set of relations as follows. Dividing (17) by and 
using the equations corresponding to i and i + I there, and that aibi ~ T, after 
rearranging terms we obtain the following relations: 



bi + bi+i 



Subtracting we obtain 



ai 

bi+i 



bi 



(bf bf^,) + 



f ai H h 


a,) = hf 




+ fli+i) 








(-^ 


1 r 




bi "' hy 


) - bi+i, 



1 



1 

1 

bi' 



1 

bi+i 



1 

bi+i 



Rearranging, 



Let 



6, I £1 + ^ + 



so that 



bi 



1 

bi+i 



l<i< k 



(18) 



~ Hi — H, 



i-l- 
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Rearranging ( 18 1 and manipulating both sides, we have 



bt+i h 

Hi{Hi+i — Hi) = Hi+i{Hi — Hi^i) 

Hi _ 
Hi -I Hi 
bi-i h 



bx+i 



1 <i < k. 



In other words, we can conclude that bi-a-ck (and hence ai<i<fc) is a geometric 
series. Let 

bi bk-i 



and 



b2 

02 



bk 

ak 



It can be checked that c > 1 since for c = 1 ( 18 ) is not satisfied 



Next, suppose we multiply the equation ( 17 1 by c — 1, we obtain 



(c - l)a,(6, + --- + 6fc+^2) 
ai{chi - bk + {c- 1)^2) 



= (c - l)bi{£i + fli + • • • + flj 
= bi{cai - oi + (c - 1)^1 ) 

-(-6fc + (c-lK2) --ai + (c-lKi 



Since the right hand side of the last equality is the same for all 1 < i < fc, from 
i — 1 and i — k, we obtain 



h bk 



(-6fc + (c-lK2) = 



Now since c > 1, 1^ 7^ and therefore 

' fcl ' bk 

bk = (C- 1)^2. 

By similar arguments, 



ai = {c-l)£i. □ 
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